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Abstract. Methods for specifying Moore type state machines (trans- 
ducers) abstractly via primitive recursive string functions are discussed. 
The method is mostly of interest as a concise and convenient way of 
working with the complex state systems found in computer programming 
and engineering, but a short section indicates connections to algebraic 
automata theory and the theorem of Krohn and Rhodes. The techniques 
are shown to allow concise definition of system architectures and the 
compositional construction of parallel and concurrent systems. 
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1 Introduction 

The engineering disciplines of programming and computer system design have 
been handicapped by the practical limitations of mathematical techniques for 
specifying complex discrete state systems. While finite automata are the natu- 
ral basis for such efforts, the traditional state-set presentations of automata are 
convenient for only the simplest systems as well as for classes of systems, but 
become awkward when state sets are large, when behavior is only partially spec- 
ified, and for compositional systems. Furthermore, it would be nice to be able 
to parameterize automata so that we can treat, for example, an 8bit memory as 
differing from a 64bit memory in only one or a few parameters. These problems 
can all be addressed by using a recursive function presentation of automata that 
is introduced here. 

General automata have long been understood to be a class of functions from 
finite strings of input symbols to finite strings of output symbols [I] but for 
specifying computer systems it is more useful to consider functions from finite 
strings of inputs to individual outputs. The intuition is that each string describes 
a path from the initial state to some "current" state and the value of the function 
is the output of the system in the "current" state. If A is an alphabet of input 
events and X is a set of possible outputs, let A* be the set of finite strings 
over A including the empty string A and then a function / : A* — > X defines a 
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relationship between input sequences and outputs. These functions can be shown 
to be strongly equivalent to (not necessarily finite) Moore type automata[5] while 
abstracting out details that are not interesting for our purposes here. If a is an 
input and w is a string, wa is the result of appending a to w and by defining 
f(A) — xq and f{wa) — h(a,f(w)), we can completely specify the operation of 



Correspondence between a transducer M and a string function /. 

Input:w => Machine:M => Output:x 
f(w) = x 



It turns out that a type of simultaneous recursion can be used to specify 
automata products that model composition and parallel (and concurrent) state 
change. Suppose that /i, .../„ are previously defined string functions, fi : A* — > 
Xi and we wish to combine these into a system where inputs from some alphabet 
A drive the components forward. At each step an input a to the composite system 
will be used to generate an input sequence Zi for each component fi. The input 
sequence for the component is a function of both a and the feedback, the outputs 
of the components. The composition builds a new function / from fi . . . , /„ plus 
a communication map g and an output map h. Let f(w) = h(fi(ui) . . . , f n (u n )) 
where the Ui are themselves primitive recursive functions of / and w. I will 
write Ui when w is clear from context and use functional form Ui(w) other- 
wise. We always require that Ui(A) = A — so that in the initial state of the 
composite system every component is in its own initial state. Let w o z be the 
string obtained by concatenating w and z. The communication map is used as 
{o\lows:Ui(wa) = Ui(w) o g(i, a, f(w)). The idea is that appending a to w causes 
the string g(i, a, f(w)) to be concatenated to itj. 



Outline. In what follows, I'll give two examples of parallel composition and 
then make the correspondence between string functions and transducers precise 
and prove the correspondence between the simultaneous recursion scheme given 
above to a " general product" of automata. The concluding section looks at some 
implications for the study of automata structure and algebraic automata theory. 
Companion technical reports describe practical use. 

The two "factors" case is illustrative. 
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/ Input — > M\ — > Outputixi \ , 
Inputs -+ .g (^ Input:U2 ^ M2 ^ 0utput:X2 J -> Outputs 

<S= < feedback < : < : 



FW = A(/i(«i),/ 2 W) 
Wi(yl) = yl 
Ui(ioo) = Ui(w) o ff (a, i, 



Example: Stack By way of illustration consider a parallel implementation of a 
stack. 



Stack n (w) = (S( Ul )...,S(u n )) (1) 

where each S(za) = a so that the n factors are are simple storage cells. Let's 
have a special value so we can spot empty cells S(A) = EMPTY and have 
some a = EMPTY in the storage cell alphabet. The alphabet of the stack is 
PUSH[v] : v £ A s torage and POP . Then define the m 

Ui (A) = A (2) 

(v) if i = 1 and a = PUSH[v] 

{EMPTY) if i = n and a = POP 
(Siiii-^w))) if i > 1 and a = PUSH[v] 
k (S(ui +1 (w))) if i < n and a = POP 



Ui{wa) — Ui{w) o < 
Then define Top(w) — S(ui) and 



, , _ ( 1 if S(ui) = 
m P y( w ) — I q otherwise. 



1 if Sim) = EMPTY 



and 

' 1 if S(u n ) ^ EMPTY 



Full(w) 



otherwise. 



(3) 



Example: Network A computer on a network might, from the outside, appear to 
have an alphabet consisting of RECV[m], TRANSMIT[m} 7 for m in a set of 
possible messages and TICK to indicate passage of time. Say D is a networked 
computer if D(w) £ {(m, c) : m £ Messages U {A^[/LL},c £ {ready, busy})} 
where D(w) = (x, y) tells us that D is trying to send message x (or not sending 
any message if x = NU LL) and that D is or is not ready to accept a message. 
For simplicity assume a broadcast network and then define 



N(w) = (D 1 (u 1 )...,D n {u n ),R(v)) 
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where each A is a network node and R is an arbiter we can define to pick which, 
if any, node gets to send a message next. Each A may be distinct as long as it 
satisfies the specifications of output values. 

R(z) e{l...n} 

The alphabet of N can just consist of the single symbol TICK. Let Ui(wa) = 
Ui(w)o(RECV[m],TICK) if R(v(w)) = j and AyKH) = (m,c) and A («*(«>)) = 
(k, ready). Otherwise, just append TICK to Uj. 

If Dj is itself a product, say Dj(w) = (OS(r os ), APP(r app ) then if w is the 
string parameter to N, we can look inside at the value of OS(r os (ui(w))). 

2 Basics 

A Moore machine or transducer is usually given by a 6-tuple 

M = (A, X, S, start, S, 7) 

where A is the alphabet, X is a set of outputs, S is a set of states, start £ 5 is 
the initial state, <5 : S x A — > 5 is the transition function and 7 : 5 — > Xis the 
output function. 

Given M, use primitive recursion on sequences to extend the transition func- 
tion 6 to A* by: 

5*(s,A) = s and S*(s,wa) = S(S*(s,w),a). (4) 

So 7(5* (start, w)) is the output of M in the state reached by following w 
from M's initial state. Call Jm(w) — 7 (5* (start, w)) the representing function 
of M. 

If /a/ is the representing function of M, then f'(w) — g(f(w)) represents M 1 
obtained by replacing 7 with "f'(s) = 9(7(5)). The state set of M and transition 
map remain unchanged. 

The transformation from string function to transducer is also simple. Given 
/ : A* -> X define f w (u) = f(w o u). Let Sf = {f w : w £ A*}. Say / is finite if 
and only if Sf is finite. Define Sf(f w ,a) = f wa and define j(f w ) = fw(A) = f(w). 
Then with start f = /a we have a Moore machine 

M(f) = {Sf,start f ,6 f ,j f } 

and, by construction / is the representing function for M(f). 

A similar construction can be used to produce a monoid from a string function 
as discussed below in section IXT1 

Any M2 that has / as a representing function can differ from Mi = A4(f) 
only in names of states and by including unreachable and/or duplicative states. 
That is, there may be some w so that 5*(starti,w) ^ S 2 (start 2, w) but since 
fw — fw it must be the case that the states are identical in output and in the 
output of any states reachable from them. If we are using Moore machines to 
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represent the behavior of digital systems, these differences are not particularly 
interesting and we can treat M(f) as the Moore machine represented by /. 

While finite string functions are the only ones that can directly model digital 
computer devices or processed, infinite ones are often useful in describing system 
properties. For example, we may want L(A) = and L(wa) = L(w) + 1 and then 
seek to prove for some P that there is a to so that whenever L(woz) > L(w)+to 
there is a prefix v of z so that P(w ov) ~ 0. In this case, L is an ideal measuring 
device, not necessarily something we could actually build. 

2.1 Products 

Suppose we have a collection of (not necessarily distinct) Moore machines M, = 
(Ai, Xi,Si, start i, Si,Xi) for (0 < i < n) that are to be connected to construct a 
new machine with alphabet A using a connection map g. The intuition is that 
when an input a is applied to the system, the connection map computes a string 
of inputs for Mi from the input a and the outputs of the factors (feedback). 
The general product here is described by Gcseg [2] . I have made the connection 
maps generate strings instead of single events so that the factors can run at 
non-uniform rates. If g(i, a, x) = A, then Mi skips a turn. 

Definition 21 General product of automata 

Given Mj = (Ai, Xj, Si, starti, 5i, 7^) and h and g define the Moore machine: 
M = A? =1 [Mi,g, h] = (A, X, S, start, S, 7) 

- S — {(si . . . , s n ) : Si £ Xi} and start = (starti . . . , start n ) 

- X = {h(xi . . . ,x n ) : Xi € Xi} and j((si s„)) = /i(7i(si) . . . ,7„(s„)). 

- 5((s 1 . . .,s n ),a) = (6l(s!,g(l,a,j(s))) . . . ,5; i (s n ,g(n,a,j(s)))). 

One thing to note is that the general product, in fact any product of au- 
tomata, is likely to produce a state set that contains unreachable states. The 
string function created by simultaneous recursion represents the minimized state 
machine as well. The possible "blow up" of unreachable and duplicate states is 
not a problem for composite recursion although it vastly complicates work with 
state-set representations. 

Theorem 1 If each fa represents Mj and f(w) — h(fi(ui) . . . , f n (u n )) 
and Ui(A) = A 

and Ui(wa) = Ui(w) o g(i, a, f(w)) 

and M = Af =1 [Mi, h,g] then / represents M 

Proof: Each fi represents Mi so 

f l (z) = ll (S*(starU,z)) (5) 

1 There is confusion on this subject for reasons I cannot fathom, but processes execut- 
ing on real computers are not Turing machines because real computers do not have 
infinite tapes and the possibility of removeable tapes doesn't make any difference. 
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But ^(6* (start, w)) = h(~f(s)) — h(. . . ji(6* (start i, wi)) . . .) for some Wi. All we 
have to show is that 

6* (start, w) — (. . . S*(starti, Ui(w)) . . .) (6) 

and then we have 

j (S* (start, w)) = h(. . ."^(8* (start l ,u i (w))) . . .). 

It follows immediately that 

y(5* '(start, w)) = h(. . . fi(ui(w))) ...) = f(w) 

Equation [5] can be proved by induction on w. Since Ui(A) = A the base case is 
obvious. Now suppose that equation [5] is correct for w and consider wa. 

Let 5 (start, w) = s = (s±... ,s n ) and let Ui(w) = Zi. Then, by the induc- 
tion hypothesis = 8*(starti, zi), and, by the argument above ^(5* (start, w)) — 
f(w). So: 

5* (start, wa) = 6(6* (start, w), a) (7) 
= 6(s,a) (8) 
= (■ ■ ■S*(s i ,g(i,a,j(s))) . . .) (9) 

= (• • ■&i(5*( start i u i(' w ))i9(h a J(w))) ■ ■ •) (10) 
= (...S*(start,Ui(w)og(i,a, f(w)))...) (11) 
= (...6*(start,Ui(wa))...) (12) 

proving [B] for wa. 

It follows directly that if M is represented by /, and / is defined by simulta- 
neous recursion, then / can also be defined by single recursion — although such 
a definition may be impractical because of the large state set size. 

3 More on Representation and Some Algebra 

A number of results follow from theorem [T] 

Theorem 2 For M and f constructed as products as above in theorem [7J 

— There are an infinite number of distinct products M' — Af =i [N{, gi] so that 
f represents M' as well as M . 

— If all of the Mi are finite state, M is finite state (by construction) . 

— If all of the ft are finite state, f is finite state ( since it represents a finite 
state Moore machine). 

— If f is finite state then there is some M' = A\ =1 r\Zi,g, h] where f represents 
M' and each Zi is a 2 state Moore machine. In fact k = [log 2 (|S , A/' |)] . This 
is simple binary encoding. 
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3.1 Monoids 

If / : A* — > X then say w =/ u iff f(z o w o y) = f(z o u o y) for all z,y £ A* . 
Let [w]/f = {u E A*,u = w}. Then define [w]/f ■ [z]/f = [w o z]//. The set 
of these classes with • comprises a monoid where [u>] // ■ [A] /j = [w] / f for the 
required identity. Say that this monoid is the monoid determined by /. Recall 
the construction of states from string functions above and the set Sf consisting 
of all the functions f w so that f w (z) = f(w o z). Note that if v, z £ [w]/f it must 
be the case that for any string r f roz — f rov . So it is possible to associate each 
[w)/f with a map from Sf — > Sf where f r h-> f roz for any z in [w]/f. As a result, 
whenever Sf is finite, there are only a finite number of maps Sf — > Sf so the 
monoid determined by / must also be finite. 

Suppose f(w) = h(fi(ui) . . . , f n (u n )) so that Ui(wa) — Ui(w) o Zi where z, 
only depends on the feedback from factors indexed by j < i. That is, there are 
ri . . .r n so that z\ = r\{a) and Zi+i — rj+i(a, /(w, 1) . . . , /(w, i)). In this case / 
is constructed in cascade where information flows only in one direction and the 
results of Krohn- Rhodes theory [4l3j will apply. 

If / is finite and represents a state machine with k states and each of the /, 
are finite with ki states in the represented state machine, then if E J ^"kj < k the 
factorization is an implementation of / by essentially simpler string functions — 
and it corresponds to a factorization of the monoid of / into simpler monoids. 

Let T n (A) = and T n (wa) = T{w) + 1 mod n. Now define G„ as a cascade 
of T2's as follows: 



This is called a "ripple carry adder" in digital circuit engineering: each counter 
increments only if the "carry" is propagating through all lower order counters. 
Put H n (w) = Zl~™T 2 (u l ) x T- 1 where the Ui are as defined for G n . Then 
H n = Ti^ and you cannot make a G n which counts mod any number other then 
2™. Otherwise, the underlying monoid of T/~ has a simple group factor (a prime 
cyclic group) and those cannot be factored into smaller elements without some 
feedback. 

While the cascade decompositions may simplify the interconnect in one way, 
they do not necessarily indicate the most efficient or interesting decomposition 
in practice. Cascades are good designs for "pipelined" execution but may be slow 
if we have to wait for the data to propagate to the terminal element. And group 
qualities in data structures can correspond to "undo" properties. For example, 
consider a circular buffer - like those commonly used for UNIX type fifos/pipes. 
The idea is that "write" operations push data into the pipe and "read" operations 
remove data in order of the "writes". The memory used to hold the data is 
allocated in a cycle. One way to implement such a buffer is to decompose it into 
an array of k memory locations and a mod k counter. A write operation causes 




(13) 
(14) 



(15) 
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an increment of the counter and a store of data in the appropriate memory 
location. The increment has an inverse, the write does not. But the result is 
that a write can be "forgotten". Perhaps factoring off group-like components 
will reveal other possibilities for this type of partial inverse. 
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